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Abstract 

In this paper we prove that the simplest band representations of 
unitary operators on a Hilbert space are five-diagonal. Orthogonal 
polynomials on the unit circle play an essential role in the development 
of this result, and also provide a parametrization of such five-diagonal 
representations which shows specially simple and interesting decom- 
position and factorization properties. As an application we get the 
reduction of the spectral problem of any unitary Hessenberg matrix 
to the spectral problem of a five-diagonal one. Two applications of 
these results to the study of orthogonal polynomials on the unit circle 
are presented: the first one concerns Krein's Theorem; the second one 
deals with the movement of mass points of the orthogonality measure 
under monoparametric perturbations of the Schur parameters. 
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1 Introduction 



Matrix representations are an important tool for the study of linear operators 
on a Hilbert space. They allow, for instance, the use of perturbation tech- 
niques for the comparison of operators defined on different Hilbert spaces. 
Besides, the freedom in the choice of the representation can be used to get 
a simple one that can make the analysis of the operator easier. Usually a 
band representation with minimum size band is desirable. A band matrix 
(cjj) is (p, g)-diagonal if Cjj = for i — j < p and j —i > q. A matrix that is 
([§]) [^"H) -diagonal or (pi^], [|])-diagonal, is called a n-diagonal matrix. If 
every operator of a certain class has a n-diagonal representation but not all 
of them have an — 1-diagonal one, we say that n-diagonal representations 
are the minimal representations of the class. 

Concerning the class of self-adjoint operators, any two-diagonal represen- 
tation must be diagonal due to its symmetry, but a diagonal representation is 
only possible in the case of pure point spectrum. Therefore, the minimal rep- 
resentations are at least tri-diagonal. In fact, they are tri-diagonal since, as a 
consequence of the spectral theorem, every self-adjoint operator is unitarily 
equivalent to an orthogonal sum of self-adjoint multiplication operators [2E] 
and, hence, the use of basis of orthogonal polynomials on the real line gives 
a tri-diagonal representation [32] • 

Unitary operators, together with self-adjoint ones, are the most important 
examples of normal operators. However, in spite of their importance, the min- 
imal representations for unitary operators are an open problem. Analogously 
to the self-adjoint case, the study can be reduced to unitary multiplication 
operators, but the use of basis of orthogonal polynomials on the unit circle 
then leads to Hessenberg instead of band representations 121 UZ1 123 EH] • 
As for the possibility of band representations, it has been recently proved 
in [3J that any unitary tri-diagonal matrix decomposes as a sum of 1 x 1 
and 2x2 diagonal blocks and, therefore, it has a pure point spectrum. This 
shows that the minimal representations of unitary operators are at least four- 
diagonal. B. Simon has conjectured in a preliminary version of jSTj that a 
similar decomposition should happen for any unitary four-diagonal matrix, 
which would imply that the minimal representations for unitary operators 
are at least five- diagonal. 

A step to get the minimal representations of unitary operators was taken 
by the authors in [6 . The results presented there imply that any unitary 
operator has a five-diagonal representation. In the next section we intro- 
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duce these five- diagonal representations and their connection with orthogonal 
polynomials on the unit circle. Section 3 is devoted to the study of such rep- 
resentations and their properties. Although Hessenberg matrices have been 
extensively studied, all this analysis will be done jointly for five-diagonal and 
Hessenberg representations for several reasons: 

- It is convenient to understand the improvements given by the five-diagonal 
representations, if compared with the known Hessenberg ones. Some concrete 
examples of the advantages of the five-diagonal representations will be clearly 
shown in the applications discussed in sections 4 and 5. 

- The connections between Hessenberg and five-diagonal representations pro- 
vide an algorithm that reduces the spectral problem of any unitary Hessen- 
berg matrix to the spectral problem of a five-diagonal one ( "five- diagonal 
reduction" of the spectral problem of a unitary Hessenberg matrix). The 
importance of this result is due to the increasing interest in the study of uni- 
tary Hessenberg matrices in numerical linear algebra [TT| IT%1 IT§| and digital 
signal processing applications (see [7] and references therein). 

- The analysis of unitary Hessenberg matrices is the main tool to prove that 
the minimal representations of unitary operators are indeed five-diagonal. 
This result is a consequence of a more general one that closes Section 3: (1, q)- 
diagonal or (p, l)-diagonal representations of unitary operators are possible 
only in the case of pure point spectrum. 

In sections 4 and 5 we consider some applications of the minimal repre- 
sentations of unitary operators to the study of orthogonal polynomials on the 
unit circle. Both applications concern the relation between the support of the 
measure of orthogonality and the corresponding Schur parameters. Section 
4 shows the advantages of the five-diagonal representation for the analysis 
of the limit points of the support of the measure, while Section 5 is devoted 
to the study of the isolated mass points. We finish this last section giving 
several explicit examples of perturbations of the Schur parameters that keep 
an arbitrary mass point invariant. 

Now we proceed with the conventions for the notation. For any subset 
.A of a Hilbert space, A is its closure and span.4 the set of all finite linear 
combinations of A. Also, if S is a subspace of the Hilbert space, A ±s means 
the subspace of S orthogonal to A. 

Given a linear operator T on a Hilbert space, T* denotes its adjoint and 
<t(T) its spectrum, while for every complex matrix M, M T is its transpose 

T 

and M* = M . I and In represent the unit matrix of order infinite and N, 
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respectively. Any matrix of order N is considered as an operator in C , and 
any infinite bounded matrix is identified with the continuous operator that it 
defines in £ 2 , the Hilbert space of square-sumable sequences in C. The inner 
products in and £ 2 are denoted by (•,•), and the corresponding canonical 
basis by {e n }. No misunderstanding will arise from this common notation. 

The term measure always means non-negative finite Borel measure, and, 
without loss of generality, we will consider only probability measures. If fi is 
a measure on a subset of C, supp/i is its support and L 2 ^ the Hilbert space of 
/i-square-integrable complex functions with inner product 



T := {z G C : \z\ = 1} is the unit circle and D := {z G C : \z\ < 1} the open 
unit disk in the complex plane. A multiplication operator on T has the form 



where fj, is a measure on T. 

2 Representations of unitary operators and 
orthogonal polynomials on T 

Given a unitary operator U on a separable Hilbert space TC, the equivalence 
between the following assertions is known |32j : 

• The spectrum of U is simple. 

• U has a cyclic vector v G TC, in the sense of span{U n v } ngZ = TC. 

• U is unitarily equivalent to a multiplication operator on T. 

A standard application of Zorn's lemma shows that any unitary operator 
can be expressed as a (finite or infinite) orthogonal sum of unitary operators 
with cyclic vectors. Therefore, the study of unitary operators becomes the 
study of multiplication operators on T. As for the spectral properties of such 
multiplication operators, it is known that cr(t^) = supp/i, the mass points 
of \x being the eigenvalues of U^. The eigenvectors of associated with an 
eigenvalue A are spanned by the characteristic function X\ of the set {A}. 




1*1 - Ll 



f(z) -» */(*) 
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For a long time, the usual attempts to get matrix representations of 
have dealt with basis constituted by orthogonal polynomials (OP) with re- 
spect to fi, that is, polynomials satisfying 

deg ip n = n, (cp n , <p m )^ = d n7m , n,m>0. (1) 

When supp/i has a finite number N of elements, dim(L^) = N and such 
a basis Q>n : = (fn)n=o comes from the orthogonalization of {z n }^~Q. <J>at 
is called a finite segment of OP associated with /i. If supp/i is infinite, 
dim(L^) = K and the orthogonalization of the infinite set {z n } n >o gives a 
sequence <fr := (ip n )n>o satisfying (0) that is called a sequence of OP with 
respect to fi. However, such a sequence is not always a basis of L 2 ^ since the 
polynomials are not always dense in L 2 

In what follows, tp n denotes the unique n-th OP with respect to \i with 
positive leading coefficient K n . It is known that these polynomials satisfy the 
recurrence relation 

Mz) = i, /gx 

p n <f n {z) =z(p n .. 1 (z) + a n (p^_ 1 (z), n > 1, 

where p*(z) := z n p(z~ 1 ) for a polynomial p of degree n, p n := a/1 — |a n | 2 
and a n G © are known as the Schur parameters associated with /i. 

Besides, when supp/i = {zi, z 2 , . . . , z^}, the same arguments that give 
(j2J) show that the polynomial ijj{z) — (z — Z\){z — z 2 ) ■ ■ ■ [z — z N ) satisfies 

K N ^(z) = Zif N -i{z) + a#¥>jr_i(z)> a N G T. (3) 

It is known that the preceding results establish a one to one correspon- 
dence between: 

• Probability measures supported on N points of the unit circle and 
vectors a N := (d, a 2 , . . . , a^) G B^ -1 x T. 

• Probability measures supported on an infinite subset of the unit circle 

and sequences a := (a n ) neN G D u °. 

If <fr is the sequence of OP related to a measure \x with infinite support, 
from (J2J) we find that the matrix of with respect to $ is a Hessenberg one 
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given by |2l H3 E3 



P1P2P3CI4 ■■■ \ 
aip 2 p3a 4 ■ ■ ■ 
-a 2 p 3 a 4 ■ ■ ■ 
— 0304 • ■ ■ 
Pa 

... j 

where a is the corresponding sequence of Schur parameters. 

The principal matrix of order N of H(&) only depends on the vector a^r 
and will be denoted by H(&n). If a^r £ D^ -1 x T and p is the related finitely 
supported measure we get from (j2J) and (jHJ) that z& N (z) = H(& N ) T <f> N (z) + 
KN-iip(z)eN , *&n being the corresponding finite segment of OP Since 
i/j(z) = /i-a.e., we find that //"(ajy) is the matrix of with respect to $at. 

Apart from its complexity, the infinite matrix H(a) represents the full 
operator only when the polynomials are dense in L 2 ^, that is, when a ^ £ 2 
[inilSS]- In general, H(a.) represents the restriction of to the closure of 
P := span{2 n }„> . Hence, although H(&) is always isometric, it is unitary 
iff zF = P. Since this condition is equivalent to P = L 2 ^, we see that H(sl) is 
unitary iff a ^ i 2 . 

The measures corresponding to sequences ael 2 constitute the so-called 
Szego class. A possibility of getting a matrix representation for in this case 
is to enlarge the OP basis to get an orthonormal basis of L 2 ^. This possibility 
is exploited in [3T], obtaining a doubly infinite unitary matrix in which H(sl) 
is embedded. Anyway, the complexity of the matrix representation remains. 

If we want to simplify the matrix representation of solving at the same 
time the problem for the Szego class, we have to change completely the choice 
of the basis for L 2 ^. Since the space of Laurent polynomials is always dense in 
L 2 ^, a more natural choice for a basis is the orthogonal Laurent polynomials 
(OLP) with respect to p, related to the corresponding OP by [SJ ES] 

X2k(z) = z' k (p* 2k (z), X2k+i = z~ k (p 2k +i(z), k > 0. (4) 

The above relation gives a finite segment of OLP Xat := (Xn)n=o m the 
case of a measure supported on N points, or a sequence X := (x„,)„>o of 
OLP for an infinitely supported measure. X^r and X always constitute an 
orthonormal basis of the corresponding space L 2 ^. 



( -a x 

Pi 



H(&) :-- 



-p\0-2 

-ai<22 

P2 



-P1P2CI3 

-aip 2 a 3 
-a 2 a 3 

P3 
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If the measure p has infinite support, we get from (J2J) the following matrix 
representation for the operator with respect to the related sequence X of 
OLP 



C(a) := 



( -a\ -pia 2 P1P2 \ 
Pi -a x a 2 a 1 p 2 
-p 2 a 3 -a 2 a 3 -p 3 a 4 p 3 p 4 

P2P3 a 2 p3 -a 3 a 4 a 3 p 4 

-p 4 a b -a 4 a 5 -p 5 a 6 p 5 p 6 

p 4 p 5 a 4 p b -a 5 a 6 a 5 p 6 

v ••■ ■• ■• •• ••/ 

a being the corresponding sequence of Schur parameters. 

Now we deal with a five-diagonal matrix that, contrary to the Hessenberg 
one, always represents the full operator and, hence, is unitary for any 
a G D H ° . Besides, it has a much simpler dependence on the Schur parameters. 

The principal matrix of order N of C(a), that only depends on ajy, will be 
denoted by C(ajv). Analogously to the case of the Hessenberg representation, 
if a N G B 7 ^ 1 x T and p is the related measure, C(a N ) is the matrix of 
with respect to the corresponding finite segment of OLP X. N : using (J2J), © 
and (jU) we find that zX.n(z) = C(sln) t ~Kn(z) + bNZ'^^~^(z), where 



b 



N 



Kjv-iejv, if N is even 

K N ^ 1 (p N ^ 1 e N - 1 + a N _ie N ), if AT is odd, 



and, since ip(z) =0 /i-a.e., we get the desired result. 

Let p be a measure on T, ((p n ) the corresponding OP and (x n ) the related 
OLP. As a consequence of the whole previous discussion, if p is associated 
with the sequence a G B H ° of Schur parameters, <r(if(a)) = supp/i for a ^ £ 2 
while cr(C(a)) = supp/i always happens. Also, if p is the finitely supported 
measure associated with a^v G B 7 ^ 1 x T, then <j(H(&n)) = (t(C(&n)) = 
supp/i. Similar relations hold between the mass points of the measure and the 
eigenvalues of the related matrices. As for the eigenvectors associated with a 
mass point A, since (X x ,ip n )^ = /i({A})y? n (A) and (X\,Xn)^ = /i({A})xn(A), 
we find that ^ n (/?„(A)e n is an eigenvector of the corresponding Hessenberg 
matrix when it represents the full operator U^, while Xl„ Xn(A)e„ is always 
an eigenvector of the related five-diagonal matrix. 

Let A be a mass point of p. Using the decomposition of X\ with respect 
to the OLP basis we find that p({\}) = (X X ,X X )^ = p({\}) 2 £ n |x«(A)| 2 
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and, so, since A G T, we get from the relation (@J) between OP and OLP that 
M{^}) = VEn l^n-CA) | 2 . Thus, when \i has infinite support, *fr(A) G £ 2 if A 
is a mass point. Conversely, if A G T is such that 3>(A) G £ 2 , then A is a mass 
point since C(a)*X(z) = lX(z), G C. Notice that these arguments also 
work using the OP basis but restricted to measures outside the Szego class. 

Among other things, the preceding results show that the minimal rep- 
resentations of unitary operators are at most five- diagonal, but, are they 
exactly five- diagonal? 

Moreover, like any unitary operator, every Hessenberg matrix that is uni- 
tary must be unitarily equivalent to a five-diagonal one. However, a question 
remains if we want to complete the "five-diagonal reduction" of the spectral 
problem for any unitary Hessenberg matrix: which one is the five-diagonal 
matrix related to an arbitrary unitary Hessenberg one? 

A deeper study of unitary five-diagonal and Hessenberg matrices will 
answer the above questions. 



3 Five-diagonal and Hessenberg matrices 



The five-diagonal matrices presented in the previous section are examples of 
the following kind of matrices, that can be considered an intermediate step 
between the five-diagonal and the tri-diagonal case. 

Definition 3.1. A (finite or infinite) five-diagonal matrix C = (qj) is called 
para-tridiagonal if c 2 k,2k+2 = c 2 k+i,2k-i = 0, VA; > 1, that is, 



/Ci,i 

c 2 ,i 




C 



Cl,2 
C-2,2 
C3,2 
CA2 



c l,3 
c 2,3 
C3,3 
C4,3 




V 



\ 





C3,4 
C4,4 
c 5,4 
c 6,4 



C3,5 
C4,5 
c 5,5 
c 6,5 





c 5,6 
c 6,6 



c 5,7 
c 6,7 



C is called irreducible if c 2j i 7^ and c 2 k-i,2k+i, c 2 k+2,2k 7^ 0, V7c > 1. 

Unitary irreducible para-tridiagonal matrices, like unitary irreducible Hes- 
senberg ones, have t\ as a cyclic vector (in fact, any vector e n , n G N, is cyclic 
in the first case). Therefore, any unitary irreducible para-tridiagonal matrix 
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is the matrix representation of a multiplication operator on T and, hence, is 
unitarily equivalent to one with the form C(a) or C(ajv). However we do 
not know how to describe this relation exactly. The following result is the 
first step to answer this and the previous questions, since it provides the gen- 
eral form of infinite unitary para-tridiagonal and Hessenberg matrices. The 
matrix representations introduced in the preceding section are indispensable 
guides for taking this step. 

Theorem 3.2. An infinite para-tridiagonal (Hessenberg) matrix is unitary 
(isometric) iff it has the form C(a, b) (H(&,b)), where a, b e C H ° are such 
that \a n \ 2 + |6 n | 2 = 1, Vn G N, and 



( ' —ai —bia 2 
bi —H\a 2 



C(a, b) 







-b 2 a 3 
b 2 b 3 



a x b 2 
-a 2 a 3 
a 2 bj, 




\ 



_0 

-b 3 a 4 
-a 3 a 4 
-b 4 a 5 
6465 



6364 
a 3 b 4 
-O4O5 
a 4 b 5 



_0 

-6 5 a 6 
-a 5 a 6 



Me 
a 5 b 6 



if(a,b) := 



V 

/-ai 



\ 



■J 



-b x a 2 -b x b 2 a 3 -b l b 2 b 3 a 4 
—Hia 2 —H]b 2 a 3 —ajj 2 b 3 a 4 
b 2 —a 2 a 3 —a 2 b 3 a 4 
b 3 -a 3 a 4 
b 4 



\ 



/ 



Proof. An infinite para-tridiagonal matrix C can be written in the way 

(cl c 2 - A 
c 4 ... 



c 






v ... 

It is unitary iff C*C = 



CI 




cc* 



cl 



' C„ G C( 2 ' 2 ), n > 2. 

...j 

/, which is equivalent to 



C\C{ — Ii, 
C n C* + 

0, n>2. 



/ 2 , 



n > 2, 



The first condition means that 



Ci = ( -ai 61 



|ai| 2 + I61I 
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and, then, by induction, we find that the rest of the equations are satisfied 
iff 

C n = _ _ , , a„ + |6„| =1, n > 2. 

y — a n _ia„ a n _iO n y 

This proves the theorem in the para-tridiagonal case. 

Now, let if be a Hessenberg matrix, that is, its n-th column h n belongs 
to spanjei, e%, . . . , e n+ i} C I 2 . H is isometric iff H*H = i, which means that 
{/z n } nS N is an orthonormal set of £ 2 . We will see that this is equivalent to 

h n = b n e n +i - a n v n , ne N 

v n = Yh=i Oi-i&A+i • • • 6„_iei, fieN, (5) 
ao = l; |a„| 2 + |6„| 2 = 1, nGN, 

which proves the theorem for the Hessenberg case. 

First of all, let us suppose that the columns of H have the form (jSJ). 
From the expression of v n we find that t> n+1 = b n v n + a n e n+ i for n G N. 
Therefore, we get by induction that v n -L{hi, h 2 , . . . , /i n _i} and = 1 

for nGN. Then, the expression for h n implies that /i n _L{/ii, h 2 , . . . , 
and (/i n , = 1 for n G N. 

On the other hand, if the columns of H form an orthonormal set of £ 2 , 
then we can write hi = — a%e\ + b\e2, \a\\ 2 + |6i| 2 = 1, and, for n > 2, 
h n = b n e n+1 +u n , b n G C, u n G {/ il ,/ i2 ,...,/in-i} ±spim{ei ' e2 -" e " } - So, h 
has the form given by (JHJ). Moreover, let us suppose that hi, h 2 , ■ ■ ■ , h n _i 
satisfy ©. Then, /i 2 , . . . , An-i} ±span{ei,e2, -' e " } = span{w n } and we find 
that u n = —a n v n , a n G C. The condition (h n , h n ) = 1 gives |a n | 2 + |6 n | 2 = 1. 
This proves by induction that h n has the form (jSJ) for nGN. □ 

A consequence of Theorem K12I is its analogue for finite matrices. The 
result for the Hessenberg case was already known [18] . In what follows, since 
the principal submatrix of order iV of C(a, b) (H(a,b)) only depends on 
ajv, bjv-i, this submatrix will be denoted by C(ajv, bjv_i) (iif(ajv, bjv-i))- 

Corollary 3.3. A finite para-tridiagonal (Hessenberg) matrix of order N is 
unitary iff it has the form C(& N ,h N _i) (H(& N ,b N _i)), where \a n \ 2 +\b n \ 2 = 1 
for 1 < n < N — 1 and \a^\ = 1. 

Proof. This result is just a direct consequence of Theorem 13.21 and the fol- 
lowing facts: a finite square matrix M is unitary iff the infinite matrix M © I 
is unitary; the matrices C(a, b) and if (a, b), with \a n \ 2 + \b n \ 2 = 1, Vn G N, 
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decompose as a direct sum of their principal submatrices of order N and an 
infinite matrix iff = 0. □ 



Remark 3.4. Decomposition property. Unitary para-tridiagonal and isometric 
Hessenberg matrices have similar decomposition properties. They decompose 
as a sum of diagonal blocks iff, for some N, = 0, that is, £ T. 
Moreover, in this situation, the blocks must again be unitary para-tridiagonal 
and isometric Hessenberg matrices, respectively. Therefore, just looking at 
the main diagonal we discover that, if e T, 

C(a) = C(a N ) © C(%aW), aW = (a N+n ) n& , 

C{&n+m) = C{&n) © C(ajva^ ), a^ = (ojv+i, • • • , ajv+Af)- 

Similar results hold for isometric Hessenberg matrices. 
Remark 3.5. Factorization property. For a, 6 G C, let us define 



6(a,6) := 




6„(a, 6) := 7 n _i © 6(a, 6) © I. 



Then, for any bounded sequences a, b e C K ° , 

oo 

C(a, b) = C (a, b)C e (a, b) T , (a, b) = J] 9 n (a n , 6 n ), 

n=l 

where the infinite product, which has to be understood in the strong sense, 
is from the left to the right, and 

C e (a, b) = h © I ®(a 2n , b 2n ) I , C (a, b) = 6(a 2 „_i, 6 2n _i). 

VneN / neN 

These factorizations show explicitly the isometric properties of the matrices 
given in Theorem 13.21 and Corollary 13.31 

We also denote 6(a) := 6 (a, y/l— |a| 2 ), 6 n (a) := I n _i © 0(a) © J, so 
that #(a) — rin^i®™( a ™) an d C( a ) = Co(a)C e (a), where C e (a) := ii © 
(0neN ( a 2n)) and C (a) := neN e(a 2 „-i). 

In the case of finite unitary Hessenberg matrices, the above properties 
have been used for spectral computations ^Hl E3 • Notice that the factor- 
ization property in the Hessenberg case is much worse than in the para- 
tridiagonal one. 
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We know that any unitary matrix represents an orthogonal sum of multi- 
plication operators on T and, hence, is unitarily equivalent to a direct sum of 
unitary irreducible para-tridiagonal matrices. However, if the initial matrix 
is also para-tridiagonal, the equivalence becomes an equality. This is just a 
consequence of Theorem 13.21 Corollary 13.31 and the decomposition property 
given in Remark 13.41 For the same reason, a similar result is also true for 
isometric Hessenberg matrices. 

Corollary 3.6. Every unitary para-tridiagonal (isometric Hessenberg) ma- 
trix is a direct sum of irreducible unitary para-tridiagonal ( isometric Hessen- 
berg) matrices. 

Even more, in the study of irreducible unitary para-tridiagonal (isometric 
Hessenberg) matrices, it is enough to consider those with the form C(a) 
(if (a)) and their principal submatrices. More precisely, we have the following 
immediate result. 

Lemma 3.7. For any a G C N °, b 6 (C\{0})*° it is if (a) = R*H(&, b)R and 
C(a) = S*C(a,b)S, where 



n = i, 

r n +i/r n = b n /\b n \, n>l, 
St = 1, 

S2 = &l/N;_ 

s n+ i/s n _i = & n -A/|frn-A|, n > 2. 

If Rn, Sn are the principal submatrices of order N of R, S respectively, 
H(a N ) = R* N H(a. N ,b N ^ 1 )R N and C(& N ) = S^H(a. N ,b N - 1 )SN- 

Notice that Theorem 13.21 and the above lemma imply that an infinite 
Hessenberg matrix is unitary iff it has the form ii(a, b) with |a n | 2 + \ b n \ 2 = 1 
and a ^ £ 2 . 

The preceding results have the following consequence, that represents the 
"five-diagonal reduction" of the spectral problem for any unitary Hessenberg 
matrix. Without loss of generality we consider only the irreducible case. 



R 



S 



/ n 



r-2 



s 2 



S3 



■■) 
■ 



12 



Theorem 3.8. Let H = (hij) be a (finite or infinite) unitary irreducible 
Hessenberg matrix and let us define 

1, n = l, 

n— 1 

JJ/ifc + i,fc ; n>2. 

k=l 

Then, H is unitarily equivalent to a para-tridiagonal matrix C = (cij) with 
the form C(a) or C(sl n ), where 

n > 1. 



The unitary equivalence is given by H = V*CV, where the columns of V 
(vij) can be recursively obtained by 



' T 



V i,j 



rr^j. 3 = 1,2, 



I t ji 



min{i+2,2j-4} j— 1 

^ c iik v kd -i - ^2 h k j-i v i,k ) , * < 2j — 2, j > 3, 

fc=*~2 fc=min{i,[i±^]} 



,0, i>2j-l, j>3. 



In the above expression the sums have to be understood only over those terms 
in which the matrix coefficients have indices between 1 and the order of H. 
The eigenvectors x(X) = Y, n x nW e n of H and y(X) = ^ n y n (A)e n °f C 
corresponding to the same eigenvalue X are related by 

x 2k -i(X) = p^rVViW, X2k(X) = I^X^y^X), k > 1. 

P2Jfe-l| P2fc| 

Proof. We will consider only the case of an infinite matrix if, the proof for 
the finite case being completely analogous. Then, from Theorem 13.21 H 
must have the form i?(a, b), a G D N °\£ 2 . So, according to Lemma f3.7| H is 
unitarily equivalent to H(a) which, at the same time, is unitarily equivalent 
to C(a) since they are representations of the same multiplication operator. 

We know that H = RH(a)R*, R = (r^j), where T{ = Tj/|Ti| since 
h>i+i,i = h. Besides, if /i is the measure related to the sequence a of Schur 
parameters and (<p n )n>o, (Xn)n>o are the corresponding OP, OLP respec- 
tively, then H(a) = U*C(a)U, U = (iMj), Uij = (ipj-i, Xi-i)u- Therefore, 
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H = V*C(a)V, V = UR*. For j = 1,2, (pj_ x = Xj-i and, so, v itj = rj5 it j. 
For the rest of the columns in V, if j > 2, ipj e spanjl, z, z -1 , . . . , z 1 ^ , zty = 
span{xo, Xi> • • • > X2j-i} and, thus, = for i > 2j. Since R is diag- 

onal, this implies that V{j = for i > 2j — 1, j > 3. Moreover, from the 
equality VH = C(a)V we get 

^J^,fe^fe,j-i = ^ Q,fcffcj-i, * < 2 j - 2, j > 3, 

fc=l k=i-2 

which completes the expression given for Vij, once the restriction Vij = 0, 
i > max{j + 1, 2j — 1}, is taken into account in the above sums. 

Given an eigenvalue A of H = RH(a)R*, the corresponding eigenvec- 
tors are spanned by ^2 n&N r n ip n _i(X)e n , while the eigenvectors of C(a) are 
spanned by XlneN Xn-i(A)e n . Hence, the referred relation between eigenvec- 
tors is just a consequence of the relation (j3J) between OP and OLP. □ 

The para-tridiagonal representations improve the Hessenberg representa- 
tions of unitary operators because of their greater simplicity. Besides, as we 
pointed out in Remark |3.4l they have similar decomposition properties and, 
thus, "Divide and Conquer" algorithms [T!J] can be also developed for the 
spectral problem of a unitary para-tridiagonal matrix. Even more, the fac- 
torization given in Remark 13.51 allows to write the corresponding five-diagonal 
eigenvalue problem equivalently as a generalized eigenvalue problem for a tri- 
diagonal pair of unitary matrices. 

Now we reach the announced result about the minimal representations of 
unitary operators. 

Theorem 3.9. A (p,q)- diagonal unitary matrix is a sum of diagonal blocks 
of order not greater than p + q if p or q are equal to 1 . 

Proof. We can restrict our attention to the case of (1, g)-diagonal matrices 
since, otherwise, we can deal with the adjoint matrix, that keeps the unitarity. 
Also, it is enough to prove that, if such a unitary band matrix has order 
greater than q + 1, then it must decompose as a sum of smaller diagonal 
blocks. Let us suppose that fl is of order greater that q + 1 and does not 
decompose. Q is, in particular, an isometric Hesssenberg matrix and, hence, 
Q = H(sl, b) or Q = H(a N , bjv-i), N > q + 2. Since it does not decompose, 
b n 7^ for all n. Thus, for j > q + 2, uj\j = — ni=i ^fc a i = implies 
dj = 0. Hence, if % < j, u>ij = —a^iaj nl=I &fc = for j > g + 2. Therefore, 
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{Q*ei,Q*e 2 , ■ ■ ■ ,Q*e q+2 } C spanjei, e 2 , . . . , e q+ i}, which is a contradiction 
with the unitarity of Q. □ 



A matrix that decomposes as a sum of finite diagonal blocks has always 
pure point spectrum. Therefore, the previous theorem shows that the only 
(p, g)-diagonal representations possible for any unitary operator are those 
where p, q > 2. Consequently, we have the following corollary. 

Corollary 3.10. The minimal representations of unitary operators are five- 
diagonal. 

4 Krein's Theorem 

One of the advantages of band representations is that they make it easier to 
decide the "smallness" of a perturbation. For example, the compactness of an 
operator is equivalent to stating that the diagonals of a band representation 
converge to 0. This makes it quite simple, for example, to apply Weyl's The- 
orem [SHI I2IH I2H] f° r the invariance of the essential spectrum. Also, it is easier 
to prove that a perturbation belongs to the trace class, which can be used 
to give a simple and elegant operator theoretic proof of Rakhmanov's lemma 
[31] using the Kato-Rosenblum Theorem [2l"l 13*01 ^1 on the invariance of 
the absolutely continuous part of the spectrum of an operator. 

In spite of the difficulties that appear, many results about the orthogo- 
nality measures of OP on T have been obtained using the Hessenberg repre- 
sentation [T21 H31 EH "HH HHj, mainly due to the efforts of L. Golinskii. The 
proofs of such results can be now simplified, but we want to show some new 
results and advantages provided by the para-tridiagonal representation in the 
analysis of the relation between measures and Schur parameters. 

First of all, we will discuss the advantages found in the application of 
Krein's Theorem [Tj, getting new results for discrete measures whose sup- 
port has a finite derived set. Krein's Theorem asserts that, given a measure 
H with infinite support, it is equivalent to saying that supp/i accumulates 
on the finite set {wi,W2, ...,Wn} and that the operator Pn(U^) is compact, 
where pn(z) = Yli=i( z ~~ w d- This theorem was established in 1962 by N.I. 
Akhiezer and M.G. Krein ("Q for measures on the real line with finite mo- 
ments. Recently, the translation to the unit circle was given by L. Golinskii 
[T2"j . who succeeded in characterizing in terms of the Schur parameters the 
measures whose support has one or two limit points, using the Hessenberg 
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representation of U^. He also proved that the Schur parameters of any mea- 
sure on T whose support has a finite number N of limit points must satisfy 



limp n p n+ i • ■ ■ Pn+N-i = 0, 



(6) 



n 



from which comes the property lim \ a n \ = 1 for any measure whose support 



has a finite derived set. 

However, with the Hessenberg representation it is hard to go further in 
this direction. The para-tridiagonal representation makes things easier, not 
only because of its band structure, but also due to its factorization properties. 
In the context of the para-tridiagonal representation, for the application of 
Krein's Theorem it is necessary to decide the compactness of p7v(C(a)), where 
a is the sequence of Schur parameters associated with \i. This requires the 
calculation of the 4iV + 1 diagonals of pjv(C(a)), some of them possibly 
giving redundant information. We can optimize the calculations using the 
factorization of Remark 13.51 

Proposition 4.1. Given Wi,w 2 , ■ ■ ■ ,wn G T, let us define 



where Pn(z) = n^il- 2 ~ w i)- Then, 5^(C(a)) is a 2N + 1-diagonal matrix 
such that 



and p7v(C(a)) is compact iff lim n q N (C(a)) n+rri)n = for m — 0, 1, . . . , N. 

Proof. From the unitarity of C(a) and C (a), the equivalence between the 
compactness of p7v(C( a )) and g7v(C( a )) follows. The matrix q N (C(a)) is a 
linear combination of products of, at most, iV tri-diagonal matrices, so, it 
is 2N + 1-diagonal. Thus, gjv(C(a)) is compact iff lim n q^{C{a)) n+rn ^ n = 
for m — 0, ±1, . . . , ±N. Hence, to finish the proof we just have to check the 
relations between q N (C(a)) and gjv(C(a))*- 

When is odd, gjv(C(a)) is a linear combination of products of an odd 
number of alternate factors C G (a) and C e (a), or their adjoints. Since C (a) 
and C e (a) are symmetric, gjv(C(a)) is symmetric too. 



n 






if N is odd, 
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In the case of even N = 2k, we can write g7v(C( a )) = n*=i r i(^ , ( a ))> 
rj(C(a)) = (C(a) — Wj)C(a)*(C(a) — Wk+i)- The result is just a consequence 
of the fact that r;(C(a))* = uJjW fc+ jrj(C(a)). □ 

Therefore, we can apply Krein's Theorem imposing only that the main 
and lower diagonals of gjv(C( a )) converge to 0, which will give in general 
N + 1 asymptotic conditions for the Schur parameters of a measure whose 
support has N given limit points. For illustrative purposes we present the 
results achieved using this procedure when applied to the characterization of 
measures whose support has up to three limit points. 

Proposition 4.2. Let p be the measure associated with the sequence a of 
Schur parameters. Then: 

1. {suppp}' = {a} iff 
\im n (a n a n+ i + a) = 0. 

2. {supp/i}' C {a, 13} iff 
lim„p„p n+ i = ; 

lim„ p n+ i (a n a n+2 - af3) = 0, 
lim„(a n a n+ i + a(3a n a n+1 + a + (3) =0. 

3. {suppp}' C {a, (3, 7} iff 

lim„ PnPn+lPn+2 = 0, 

lim n p n+1 p n+2 (a n a n+3 + a/3j) = 0, 

lim n p n+ i(a n a n+ i + a n+1 a n+2 - af3^a n a n+2 + a + (3 + 7) = 0, 
\im n (a n al +1 - p 2 n+1 a n+2 + a(3^(a 2 n a n+ i - a n -ip 2 n ) + 

+ (a/3 + /?7 + 7a)a n + (a + (3 + j)a n+ i) = 0. 

The first result of the above proposition is the same one obtained in [TJ] , 
but the second assertion simplifies the one given in ^2]- Notice that the 
relations given in the two last cases of the proposition also imply 

limp n+ i (a n a n+ i + a n+1 a n+2 + a + f3) = 0, 

n 

if {supp/i}' C {a,f3}, while, for {suppp}' C {a, (3, 7}, 

limp n+ ip n+2 (a n a n+ i + a n+1 a n+2 + a n+2 a n+3 + a + (3 + 7) = 0. 

n 
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The above results suggest the following improvement of the property (JHJ) 
that gives a common feature for measures whose support has a finite derived 
set. 

Theorem 4.3. Let fi be the measure associated with the sequence a of Schur 
parameters. If {suppp} C {wx, W2, ■ ■ ■ , %}, then 

N 



limTTp n+4 = 0, N>1, 

n ■*■ ■*■ 

i=l 

N-l 

lim (a n a n+N - pj JJ p n+i = 0, N > 2, 
i=i 

(N \ iV-1 

y^q n+ j_ia n+ j + 5 J p n+i = 0, iV > 2, 
j=i / i=l 

(N-l \ AT-2 

^ a n+i _ia n+i + Pa n a n+N _i + ^ J II P«+* = °> ^ > 2, 
i=l / i=l 

where P := (— l) iV wiW2 • • • Wat a«d S 1 := wi + u>2 + • • • + Wn- 

Proof. We will consider only the case of even N = 2k, since the analysis for 
odd N is analogous. Then, the operator qisr(C(a)) associated with the points 
Wi, w 2 , • • • , wn, given in Proposition 14. H has the form 

q N (C{a)) = C{af - SC(a)^ 1 + SPC(a)**- 1 + PC(a)* k . 

For N - 3 < m < N, q N (C(a)) n+m , n = q$ (C(a)) n+m>n , where 

g$} ) (C(a)) = C(a) fc - ^(a)*" 1 - 5PC(a)* fe - 1 + PC(a)* fc . 

So, lim n g^- ) (C(a)) n+mjn = 0, iV — 3<m<iV, under the hypothesis for ji. 

Let us examine the coefficients qff (C(a)) n+m>n iov m — N, N — 1, N — 2. 
We can write C (a) = A D + ^P + P D K* and C e (a) = A e + + P e K*, V 
being the right shift, defined by Ve n = e n+ i, and 

. _ f -a n e n , odd n, . j a n _ie„, odd n, 

[ a n _ie n , even n, [ -a n e n , even ra, 

o j Pne n , odd n, _ f 0, odd n, 

U, even n, [ p n ^n, even n. 
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Taking into account that V, V* rise and lower the indices of the vectors 
e n , respectively, V*e 1 = 0, and B Q , B e vanish over vectors with even and odd 
index n, respectively, we find that 

(gS } (C(a)) en ,e n+w ) = (((VB VB e ) k + P(VB e VB ) k )e n , e n+N ) = 

f PnPn+i ■ ■ ■ Pn+N-i, even n, 

" I PpnPn+1 ■ ■ ■ Pn+N-U odd 71, 

(q^\C(a))e n ,e n+N ^i) = 

= {{{VB VB e ) k - l VB A e + A VB e (yB VB e f- x )e w e n+JV _i) + 

+P(((VB e VB ) k - 1 VB e A* + A* e VB (yB e VB ) k - x )e w e n+N ^) = 

~Pn ■ • ■ p n +N-2(a n +N-i ~ Pon-i), even n, 
-Ppn ■ ■ ■ Pn+N-2(a n +N-i - Pa n -i), odd n. 

Therefore, lim„ [Tjli Pn+i = and lim n (a n+N - Pa n ) Y[^=i Pn+i = 0, which 
is equivalent to the first and second equalities of the theorem. 

Concerning the coefficients (q^\C(a))e n , e n+ N- 2 ), we have that 

(C(a) fc e n , e n+N _ 2 ) = ((VB VB e ■ ■ ■ VB VB e A A e )e n , e n+N „ 2 ) + 

+ ((VB VB e ■ ■ ■ VB A e A VB e )e n , e n+N ^ 2 ) + ■■■ 
■■■ + ((A VB e ■ ■ ■ VB VB e VB A e )e n , e n+N . 2 ) = 

~Pn ■ ■ ■ Pn+v-3(o„_ia n H h a n+N - 3 a n+N - 2 ) , even n, 

—Pn ■ ■ ■ Pn+N-3 a n-ia n +N-2, odd 71, 

(C(a)* fe e n , e n+N _ 2 ) = {{VB e VB ■ ■ ■ VB e VB A* e A* )e n , e n+N _ 2 ) + 

+ ((VB e VB ■ ■ ■ VB e A* A* e VB )e n , e n+N _ 2 ) + ■■■ 
■■■ + ((A* e VB ■ ■ ■ VB e VB VB e A* )e n , e n+N „ 2 ) = 

-p n • ■ ■ p n+ Ar_ 3 a n _ia n+A r_2, even n, 

-Pn ■ ■ ■ Pn+N-3( a n-ia n + ' ' * + a n+ n -3 a n+ N -2) ■, odd 71, 

(C(a) fc - 1 e n ,e n+JV _ 2 ) = {{VB^B.f-^, e n+JV - 2 ) = 

_ f PnPn+l ■ ■ ■ Pn+N-3, even 71, 

~ \ 0, odd n, 

0, even n, 

PnPn+l • • • Pn+N-3, odd 71. 
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From these results the last equality of the theorem follows. The third relation 
is just a consequence of the other ones. □ 



5 Perturbations of the Schur parameters and 
mass points 

In the previous discussion we have exploited the band structure and factor- 
ization properties of the para-tridiagonal representation. Now we will also 
show the advantages of its simple dependence on the Schur parameters, in 
particular, of the fact that, contrary to the Hessenberg representation, any 
Schur parameter appears in only a finite number of elements of the para- 
tridiagonal representation. 

The application to the study of OP of standard results of operator theory, 
like the Weyl, Krein or Kato-Rosenblum theorems, gives information about 
the limit points of the support of the orthogonality measure. However, for 
the analysis of isolated mass points other tools are more appropriate. This 
last section illustrates the usefulness of the para-tridiagonal representation 
for this purpose too. Our aim is to study the behaviour of the isolated 
mass points of the measure under monoparametric perturbations of the Schur 
parameters using the Hellmann-Feynman Theorem [HI EHJ EU E3] • 

Let us suppose a sequence a(t) G D H ° depending on t 6 I, where I is an 
interval of R. A measure //* corresponds to each value of t. The related OP 
and OLP sequences will be denoted by <&* := (v?n)n>o an d X* := (Xn)n>o 
respectively. 

Besides, let u(t) be a function of t 6 / with values on T. For each 
t we can consider the finitely supported measure fi% corresponding to the 
parameters (di(t), . . . , ajv-i(t), u(t)), whose finite segments of OP and OLP 
are respectively & N := {tp\^ n ~£ and ~K l N := (^)^Tq 1 . The importance of 
such discrete measures is that they weakly converge to /x* and, thus, they 
provide the, so called, Szego quadrature formulas [24J for the measure //. 

We are interested in the evolution with t of the isolated mass points of 

that is, the isolated eigenvalues of C(t) := C(a(t)). We will also analyze 
the movement of the mass points of the discrete approximations that is, 
the eigenvalues ofCjv(i) := C(ajv(£)), a^it) := (ai(t), . . . , ajv-i(i), u(t)). 

Since the finite matrices Cjv(t) have N different eigenvalues, in any in- 
terval where Si N (t) is different iable with respect to t, its eigenvalues are 
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differentiable functions A(t) j2Hl- Moreover, the corresponding eigenvectors 
~X t N (X(t)) are also differentiable in t, since Xjv(<z) is a differentiable function 
of a±, . . . , Oiv-ij z. 

Concerning the infinite matrix C(t), a similar result holds, but only lo- 
cally. More precisely, let us suppose that C(t) is differentiable in norm with 
bounded derivative C(t) and ||C(t)|| locally bounded. Then, if Ao is an iso- 
lated eigenvalue of C(t ), there exists a neighbourhood of t where C(t) has 
an isolated eigenvalue \(t) which is differentiable and such that A(£o) = Ao- 
Moreover, a related eigenvector can be chosen as a strongly differentiable 
function of £ in a neighbourhood of to |22]- 

This last discussion justifies the following lemma. 

Lemma 5.1. Let Q(t) = (<^i,j{t))ijes be a bounded band matrix depending 
on a parameter t 6 I, I being an interval ofM.. Assume that the coefficients 
u)i,j(t) are twice differentiable and sup ij6N l^i^WI; su Pijgn l^ijWI are locally 
bounded on I . Then, Q(t) is differentiable in norm with bounded derivative 
Q(t) := (^ij(t))ij eN and \\Q(t)\\ is locally bounded on I. 

Proof. We can write Q(t) = Q (t) + £f =1 (^ & (*) + ^-k(t)V* k ), where 
Qk(t), \k\ < N, are diagonal matrices and V is the right shift. Hence, 
if the statement is true for the matrices fifc(i), it is also true for Q(t). 
So, we just have to check the proposition for a diagonal matrix Q(t) = 
diag(u;i(t), u 2 (t), . . .). If u n (t) are differentiable and sup ngN |u; n (i)| is lo- 
cally bounded on J, Q(t) := diag(co , 1 (t), u 2 (t), . . .) is bounded with = 
sup ngN |u; n (t)| locally bounded on /. If, besides, u n (t) are twice differentiable 
and sup ngN |u; n (£)| < K in a neighbourhood of to, using the mean value 
theorem we get 



t-tn 



sup 

ngN 



U n {t) - U n (t ) i , N 
W nl £ 0, 



t-tr 



< K\t-t 



o 



for t in such a neighbourhood. This proves the differentiability in norm. □ 

Now we can state the following result for a differentiable monoparametric 
perturbation of the Schur parameters. 

Proposition 5.2. Let a n : I ^ D be differentiable for n G N. Then: 

1. If u: I — > T is differentiable, the mass points of are differentiable 
functions A: I — > T satisfying 

x\t) = Mm}) xumfc'At) nm)- 
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2. If a n : I — > D twee differentiable for n G N and sup ngN |a^(t)|, 
sup ngN |a n (t)|, sup ngN |p n (t)|, sup ngN |p n (t)| are locally bounded on I, 
for any isolated mass point Ao 0/ there exists a differentiable func- 
tion X: J — > T on a neighbourhood J of t such that A(i) an isolated 
mass point of /J for t G J and A (to) = Ao- 77ms function satisfies 

X'(t) = ^{X(t)})X\X(t)) T C\t)X^{XW)- 

Proof. From the previous discussions and Lemma 15.11 we find that the re- 
ferred differentiable functions X(t) exist under the conditions of the theorem. 
The expression for A (t) follows from the Hellmann-Feynman Theorem for 
normal operators. Let us consider the infinite case since the analysis of the 
finite case is analogous. The mass points X(t) are simple eigenvalues of C(t) 
with associated eigenspace spanned by X t (X(t)). We know that there exists 
a strongly differentiable eigenvector Y(t) of C(t) with respect to X(t). There- 
fore, just differentiating the equality Y(t)*C{t)Y(t) = X{t)Y(t)*Y(t) and 
bearing in mind the unitarity of C{t), we get Y(t)*C'(t)Y {t) = X'(t )Y(t)*Y(t). 
This relation is also true when substituting Y{t) by X t (X(t)) since they 
are proportional. The statement 2 is then a consequence of the equality 
X\X(t)fX^Xtt)) = Enen l^n(A(t))| 2 = l//A{A(t)}). □ 

It is natural to expect a qualitatively different behaviour of the mea- 
sure under rotations or dilatations of the Schur parameters. Therefore, it 
could be interesting to examine the preceding result when we decompose 
the monoparametric perturbation in the way a n (t) = r n {t)e %an ^\ r n (t), a n (t) 
being real functions. 

Theorem 5.3. Let a n {t) = r n (t)e ia " {t \ where r n : I -> (-1,1), a n : I -> R 
are differentiable for nGN. We define the functions 

I* B (*) := Im(e-^V-*(^_i(z)) 2 ) , ^N, 

K{z)--=Wn-i{z)?-mz)\ 2 , neN. 

1. If u(t) = e l ^\ being (5:1 K differentiable, the mass points of f/ N 
have the form A(t) = e %e<yt \ where 9: 1 — > WL is a differentiable function 
that satisfies 

2V-1 

e'it) = ^({A(t)}){ (r'nitKW)) + * n (t)K(Ht))) + 

n=l 

+(5\t)^N-i(X(t))\ 2 }- 
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2. Let r n : I — > (—1,1), a n : / — > R be twice differentiable. Assume that 
sup ngN \r' n (t)\, sup n6N |r^(i)|, sup neN \a' n (t)\, sup neN \a' n (t)\ are locally 
bounded on I and there exists r < 1 such that \r n (t)\ < r whenever 
r n (t) 7^ 0. Then, if Xq is an isolated mass point of fi to , there exists 
a differentiable function A: J —>■ T on a neighbourhood J of to such 
that X(t) is an isolated mass point of /x* for t 6 J and A (to) = -^o- 

zs a differentiable function that satisfies 

oo 

e\t) = Am)}) E (^)it(A(O) + <£(fK(A(f))) . 

n=l 

Remark 5.4. Notice that the above series converges due to the suppositions 
about the sequences (r n (t)) ne ^, (a n (t)) nS N and the fact that <&*(A(t)) G £ 2 
since A(i) is a mass point of //. 

Proof. The conditions given for the sequences (r n (t)) n£ N, («n(t))neN an d the 
function (3(t) are enough to apply Proposition l5.21 Therefore, the referred dif- 
ferentiable functions X(t) exist. Since R is the universal covering space of T, 
with the imaginary exponential as a covering map, there exists a unique con- 
tinuous real valued function 9(t) such that A(£) = e ld ^\ 9 (to) = Arg(A(to))- 
Moreover, the imaginary exponential is locally invertible with differentiable 
inverse, so, 9(t) must be differentiable too. From Proposition 15.21 we know 
that 

The rest of the proof is just the calculation of the right hand side of 
the above expression, which we will do only for the infinite case since the 
arguments in finite case are similar. We can easily do this calculation using 
the factorization C(t) = C Q (t)C e (t), C Q (t) = C (a(t)), C e (t) = C e ( a(t)), given 
by Remark EH1 As a co nsequenc e of © and (gj), C e (t)X*(A(t)) = X*(A(t)) 
and C (t)X'(A(t)) = A(t)X*(A(t)). Therefore, 

X\\(t)) T C'(t) = X\\(t)) T C' (t) X\\ (t)) + 

+\(t)X t (\(t))*C' e (t)Xt(\(t)), 

which, using (J3J), gives 

oo 

9'(t)= i ^({X(t)}) E A (*)~ n {^(*)^-i( A (*)) 2 -<(0^(A(i)) 2 - 

n=l 

-2 P :(t)^_ 1 (A(t))^(A(t))}. 
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Finally, the expression given in the theorem for 9 (t) follows from the above 
one, taking into account (J2J) and the relations 

a' n (t) = r' n (t)e^ + ia' n (t)a n (t), p n {t) = r' n (t). 

Pn{t) 

□ 

From the above theorem we directly get a bound for the angular velocity 
of the isolated mass points. 

Corollary 5.5. Under the conditions of Theorem \5., 6 Jl 



2 

1 — r' 2 n >\ ' " n>l 



\0'{t)\ < - ^ sup|7Y,(t)| + sup|a4(t) -a n _ x (t)l 



where «o = and, in the case of p} N , the sums run from 1 to N and = (3. 

The particular case of uniform rotations of the Schur parameters is spe- 
cially interesting. It has been previously considered in [15^ and by the authors 
in jl El El- 
Corollary 5.6. Let a G u G T and a: I — > R differentiate. If a n (i) = 
a n e ia ^' for n G N and u(t) = ue ia ^\ then: 

1. The differentiable arguments of the mass points of f/ N satisfy 

e'(t)=»%({\(t)})a'(t). 

2. If a" (t) exists and is locally bounded on I, the differentiable arguments 
of the isolated mass points of //* satisfy 

6'(t)=S({\(t)})a'(t). 

Proof. Apply Theorem E21 to r„(t) = \a n \, a n {t) = a(t) + Arg(a n ) and f3{t) = 
a(t) + Arg(u). Notice that a (t) is locally bounded on / if a(t) is twice 
differentiable. □ 

This result is the generalization to arbitrary measures of the one founded 
in [5] for finitely supported measures using the Hessenberg representation. 
It says that under a uniform rotation of the Schur parameters the isolated 
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mass points of the corresponding measure rotate in the same direction and 
the mass of each point gives its relative angular velocity with respect to the 
angular velocity of the Schur parameters. Therefore, a mass point rotates so 
much more quickly with the Schur parameters as its mass gets bigger. In fact, 
Theorem 15.31 suggests that, in general, the mass of an isolated mass point 
gives a measure of its instability under perturbations of the Schur parameters. 

The study of the relation between Schur parameters and measures implies 
the attempt to find families of Schur parameters associated with measures 
with some common features. Theorem 15.31 opens a way to find monopara- 
metric families of Schur parameters whose measures have a common mass 
point. Among the ways to do this, we will just select some of them. 

5.1 Measures with a fixed mass point 

Let \x be the measure corresponding to a sequence a = (r n e"* n ) ng N of Schur 
parameters and (y? n )n>o the associated OP. If A = e td is an isolated mass 
point of fi, our aim is to find monoparametric perturbations a(t), a(i ) = a, 
such that the corresponding measures /x* have the same mass point, at 
least in a neighbourhood of to. We will also consider the analogous prob- 
lem for the finitely supported measures hn associated with the parameters 
(ai, . . . , a N _i, u), u — e llS . In what follows we suppose that the perturbation 
satisfies the conditions given in Theorem 15.31 



This case corresponds to the perturbation of only the n-th Schur param- 
eter. So, the first n OP coincide with the unperturbed ones. Using (0) we 
get from Theorem 15.31 that A = e ld is a fixed mass point if 





r\t) Im(e- 4a WA 2 -"(^ n „i(A)) 2 ) 



a'(t)r(t) {Re(e— WA^^.^A))) +r(t) 2 |^ n _ 1 (A)| 2 } . 



If (<£> n _i(A)) 2 = \ipk(X)\ 2 e 1 ^, the above equation becomes 



d(r sm(a + (n - 2)9 - £)) + r 2 da = 0, 
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whose solution for the conditions r(t ) = r n , a(t ) = a n is 



sm c 



sin(o; + (n - 2)9 - £ - c) 

r n sin(a n + (n - 2)9 - £) 



c = arctan 



1 + r n cos(a n + (n — 2)9 — £) 



The same solution appears in the case of a finitely supported measure /ijv, 
N > n, if we leave the parameter u unperturbed. 

{a>k if k < n, 

r(t)e ia ^ i£k = n, (r(t ) = r n , a (to) = «„) 
e i(a(t)-a„) Qfe if jfc > n- 

Again, the first n OP coincide with the unperturbed ones. The condition 
given by Theorem 15. HI for a fixed mass point A = e %e is now 

2dr ,„ 

r sinfa + fn — 2)6' — £) — eta = 0, 

1 — r 2 

where £ is the phase of (y9 n _i(A)) 2 . The solution for the conditions r(t Q ) = 
r n ,a(t ) = a n is 

sin±(a + (n-2)0-f-c) 
T ~ sin±(a + (n-2)0-f + c)' 

c = 2 arctan ( tan — (a„ + (n — 2)0 — £) 

\l + r n 2 V V ; ^ 

This solution remains valid in the case of a measure fi^, N > n, if we also 
include a perturbation u(t) = e^™® - ™"^ of the parameter it. 

{afc if < n, 

r(t)e ia " iik — n, (r(t ) = r n , a(t ) = 0) 
e ia{t) a k if k > n. 

As in the previous cases, the first n OP coincide with the unperturbed 
ones. From Theorem 15 . 31 and using (J2J) we find that the perturbations of this 
type with a fixed mass point A = e ld are characterized by 

2 sin(a n + (n - 2)9 -g)dr-(l + 2r cos(a n + (n - 2)9 - f ) + r 2 ) da = 0, 
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where, again, £ is the phase of (</>„_! (A)) 2 . The solution for the conditions 
r(t ) = r n ,a(t ) = is 



r = 



sin(|a + a n + (n — 2)9 — £ — c) 



c = arctan 



sin(~a — c) 
/ sin(a n + (n - 2)6* - £) 



) 



\r n + cos(a„ + (n - 2)0 - £) 



which is also valid in the case of a measure pn, N > n, if including a 
perturbation u(t) = e ia ^u of u. 

If //° has an isolated point at A = e , the previous relations between r 
and a provide perturbations of the Schur parameters that give families of 
measures with the same mass point A, at least for r, a in a neighbourhood of 
r(t ), a (to)- In the case of a finitely supported measure this neighbourhood 
is only restricted by the condition \r\ < 1. 

The simplest case of the above perturbations happens when n = 1, where 
always £ = 0. Another particularly simple situation is the perturbation 
of a Geronimus measure, that corresponds to a constant sequence of Schur 
parameters. 

Example: perturbations of Geronimus measures with a fixed mass point. 

Let us consider the measure corresponding to a constant sequence of Schur 
parameters a n = a e D\{0}, n > 1 ^0]. This measure has an isolated mass 
point at A = (1 - a)/(l - a) if |a - 1/2| > 1/2, that is, if Re(a) < |a| 2 . 
The related orthogonal polynomials are ^(z) = p~ n (u n+ i(z) — (1 — a)u n (z)), 
where p = a/I — |a| 2 , «„(z) = (it7i(z) n — w 2 (-2) n )/(wi(2;) — 102(2)) and Wi(z), 
w 2 {z) are the solutions of w 2 — (z + l)w + p 2 z = [TTj. If A = e i61 , the 
phase of u>i(A) and w 2 (A) is 9/2 and, thus, (n — 1)9/2 is the phase of u n (X). 
Hence, £ = (n — 1)^ and a + (n — 2)9 — £ = a — 9 for all n. In this case, the 
relations between r and a that give a fixed mass point for the three previous 
perturbations are independent of the index n of the Schur parameter where 
the perturbation starts. 

Let us write a = r e ia °, r , a$ G R. Then, the condition for the existence 
of an isolated mass point is cosao < tq. Using the explicit form of the mass 
point we find that 



(1 + r 2 ) cos« - 2r 
1 + Tq — 2r cos ao 



(1 - r 2 )sina 



cos^ao - 



sin(a — 



1 + rg — 2r cos ao 
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Taking into account these expressions we can find explicitly the relations 
between r and a that give a fixed mass point at A = ( 1 — a) / ( 1 — a) in the 
case of the three perturbations previously studied. We find the following 
results: 



Case 1. r 



Case 2. r = 



Case 3. r 



r sin «o 



sin a — r sm(a — c*o) 

r sin |(3a:o ~ a ) + si* 1 |( a — a o) 
sin |(a + a) — r sin |(a — a ) 

r sin |(2a — a) + sin |ct 
sin |(2a — a) + r sin |ct 



Notice that in the first and second cases r = r for a = oto, due to the 
initial conditions r(t ) = r , a(t ) = a , while in the third case r = r for 
a — 0, since r(t ) = ?"o; «(^o) — 0- If — 0, we can choose a(t) = «o + 1 in 
the first two cases and a(t) — t in the third one. Then, as a consequence of 
the previous results, we find that, in a neighbourhood of t — 0, the following 
families &(t) of Schur parameters are related to measures with a common 
mass point at A = (1 — a)/(l — a) (we assume a e D\M and Re(a) < |a| 2 ): 

if A; 7^ n, 

^ L ° fc(t) = i T _,^,„,, 7^—. e U a, if k = n. 

ii k < n, 

Case 2. a fc (t) = <( w ^ ~ ~T TTTii "p^TT^fl e * <a ' if fc = n ' 

if > n. 
if k < n, 

Case 3. a k (t) = ^ YZj^TZZ t' 2 , 7i7i2 — pTTTri ^TTl a ' ^ ^ = n > 

if > n. 



=1 


[ a, 


Im(a) 






[ Im(a) cost 


-(l«| 2 


g 

— Re(a)) sint 




f a, 

! Im(a) cos 




- Re (a)) sin | 


■I 


1 Im(a) cos | 


— (I«l 2 


— Re(a)) sin | 




f a, 

1 Im(a) cos 




- Re(a)) sin | 




I Im(a) cos | 


+ (M 2 


— Re(a)) sin | 
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